The intermediate evolution phase in case of truncated selection. 
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Using methods of statistical physics, we present rigorous theoretical calculations of Eigen's quasispecies the- 
ory with the truncated fitness landscape which dramatically limits the available sequence space of a reproducing 
quasispecies. Depending on the mutation rates, we observe three phases, a selective one, an intermediate one 
with some residual order and a completely randomized phase. Our results are applicable for the general case of 
fitness landscape. 



I. INTRODUCTION 

Developing realistic evolution models poses an important 
challenge for evolution research yj, |2[]). After the seminal 
work by Eigenyl and successful experiments with the self- 
replication of macromoleculesflUt], several theoretical stud- 
ies attempted to explain this molecular evolution phenomenon 
[6-18]. Realistic fitness landscapes are not smooth, include 
neutral and lethal types, as observed in recent experimental 
studies with RNA viruses l20[|2Tll22ll . In the most of evolution 
articles symmetric fitness landscapes are considered, where 
the fitness is a function of the Hamming distance from the 
wild (reference) sequence. While solving evolution models, 
the vast majority of results for the mean fitness have been de- 
rived using uncontrolled approximations even for the symmet- 
ric fitness landscapes, with too simplified sequence space [23] 
and ignoring back mutations lfloll . A simplified geometry with 
only two (Hamming) classes for sequences with nonzero fit- 
ness was used in studies that investigate the role of lethal mu- 
tants in evolution! 24]. Furthermore, in most evolution mod- 
els the whole sequence space is assumed to be available for 
the evolving genome. However, the sequence space a lim- 
ited population can use is severely restricted to a small part of 
the sequence space surrounded by an unsurmountable moat of 
lethal mutations. In this paper, we attempt to rigorously solve 
this case of a truncated fitness landscape for symmetric fitness 
landscape. 



II. THE SYSTEM 

In Eigen 's theory JKH, an information carrier reproduces 
with a certain rate r^, producing offspring of the parental type 
with the probability Qu and offspring of another (mutant) type 
k with the probability Qki, 

dXijdt = {QuTj - TkXk (t)}Xj (t) + ^ QjkTkXk [t) (1) 
k k^i 

The whole sequence space contains M — 4 L different se- 
quences, where L is the genome length. The mean fitness of 



the system is R = J^k r kXkif) where Xi are the relative fre- 

4-L — 1 

quencies of the different genotypes (^2 k=0 x^ — 1). The 
master type has the maximal fitness ro, and its copying fi- 
delity is Q = Qqo — q L , where q is the average incorporation 
fidelity and L the chain length. It is convenient to work with 
the error rate u = L(l — q), leading to Q = e~ u . In this arti- 
cle we consider only the case L — ► oo, u is finite. Eigen maps 
each genotype precisely into a node on the L-dimensional hy- 
percube and has thus the correct connectivity for each type. 
The minimal number of steps leading from one position i in 
sequence space to another one, j, is the Hamming distance 
dji- 

In the evolution process the information content of the pop- 
ulation can be maintained only when the selection force is 
higher than the dissipating one (mutation). Otherwise, above 
the error threshold, the information gets lost. 

It has been shown that the system of nonlinear differential 
equations in Eq. Q] can be transformed to an infinite system 
of linear equations, connecting Eigen's model with statistical 
mechanics0,|^]- For a single peak fitness landscape (ro = A 
and Ti^o = IX the following condition for conserving the 
master sequence in the population holds Q Qii : 



AQ > 1 

where AQ = 1 is the error threshold. 
At the selective phase one has [25] 

QA-1 
x ° = ~A~T ~ ' 



(2) 



(3) 



and Xi ~ 1/ L d for d<L, where d is the Hamming distance 
from the wild sequence, see Eq. (21) in lfl5ll . We choose the 
sequences with 1 < I < L from the corresponding l-th Ham- 
ming classes. A scaling by Eq.(3) exists also for the rugged 
(Random Energy Model like) fitness landscapes [11]. Scaling 
like the one in Eq. (3) has been applied in models of popula- 
tion genetics with few alleles. In realistic fitness landscapes, 
however, the wild type is present only in a few percents. As- 
suming neutrality, we can attain such scaling: a substantial 
fraction of one mutation neighbors of the wild sequence have 
the same high fitness. Neutrality increases the probability of 
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such mutants and suppresses xq as l° w as 

x ~ 1/VZ. (4) 

This result could be derived easily using Eq. (6) in l26ll . for 
the case when there is a central neutral sequence and large 
fracture of neutral sequences among the neighbor sequences 
of the central sequence. 

In non-selective phase one has 

x ~ l/M, (5) 

where M is the total number of sequences, M = 4 L . 

The error threshold phenomenon closely resembles the 
ferromagnetic -paramagnetic phase transition! 10], where the 
fitness of the system corresponds to the microscopic energy 
of the physics system, the mean fitness of the quasispecies to 
the free energy, and the mutation rate to temperature. To iden- 
tify the different phases in statistical physics one uses the free 
energy and also the order parameters. A phase transition oc- 
curs when, during a change of temperature, the analytical ex- 
pression of the free energy changes. Order parameter changes 
also: while magnetization is non-zero in the ferromagnetic 
phase, it is zero at high temperatures and in the absence of 
a magnetic field. A phase transition in evolution is identi- 
fied by observing the mean fitness R = J^i x i r i an d choos- 
ing proper order parameters, for instance the degree of dis- 
tribution around the master sequence, the surplus production, 
s = x i(l ~ 2doi/£), where doi is the Hamming distance 
from wild type. 

Instead of the 4-letter alphabet of genotypes, we consider 
only two symbols in a genome, the spins "+" and thus 
now M — 2 L [8]. Base substitutions correspond to sign 
changes of the spins. It is particulary easy to analyze land- 
scapes where the fitness values are simple functions of the 
Hamming distance [8]. The N-dimensional sequence space is 
then transformed into a quasi-one-dimensional, linear chain of 
mutant classes I where pi — NiX\ comprises all types with the 
Hamming distance number I from the master and the fitness 
value J i. Ni = w£zjy\ i s tne number of different genotypes 
in the class. The parameter I can be identified as a phenotype 
parameter. 

There is a principal difference between quasi-one dimen- 
sional model, derived rigorously from the initial sequence 
space with 2 N sequences and the one-dimensional one con- 
sidered in [22] and other articles. In contrast to other one- 
dimensional models used earlier 1I23I1 where each class con- 
tains only one type, in our case any class I is composed of 
Ni types and thus retains the connectivity; the Hamming dis- 
tance between two sequences in the same class can take any 
value from to 21. Moreover, when evolution equations are 
formulated for class probabilities, the effective mutation rates 
to the lower class, ~ (L — l)/L, and to the higher class, 
l/L are different and change with / lfl3tl . ll2"7tl . In contrary, 
in the one-dimensional model of [22] these mutation rates are 
1-independent. 

In our quasi-one dimensional model J; can be transformed 
into the f(k) where f(k) is an appropriate smooth function 
with the maximum at k = 1, and /(0) = 1. The "magneti- 
zation" parameter k is defined as k = (1 — 21/ L). A correct 



version of 1 -dimensional evolution model has been suggested 
first in [27], the discrete time version of parallel model with a 
linear fitness. 

Consider now the solution of the Eigen model with sym- 
metric fitness landscape. The mean fitness R for the fitness 
function / has been derived as follows: fl5ll 

i?-max{/(fc)g( 1 - v ™)}|_ 1 < fe < 1 . (6) 

s can be identified from the mean fitness expression using an 
equation 

/(*) = R (7) 

as has been derived in lfl3ll for the parallel model. Thus in 
Eq.(6) the maximum is at some k , an order parameter of the 
system quantifying the bulk spin magnetization, while the sur- 
plus s corresponds to the surface magnetization. Eq. (6) is an 
exact expression (at the infinite genome limit), while in other 
studies fiol l24ll back mutations have been ignored. As shown 
by Tarazona|9], the Eigen model is not equivalent to the sim- 
ple ferromagnetic system of spins in the lattice, but only to 
those spins interacting both inside the bulk of the lattice, and 
on the surface of the lattice. In this work, different phases 
will be characterized by R, the mean fitness, by kg, the bulk 
magnetization, by xq, the fraction of the wild type of the total 
population and by s, the surplus. When fc = 0, resulting in 
s = 0, the population spreads statistically in sequence space, 
indicating a non-selective phase. 

We gave the mean fitness and error threshold (when s in Eq. 
(7) becomes 0) for the symmetric fitness landscape. The point 
is that this transition has also information theoretical meaning. 
Eigen actually found the error threshold from information the- 
oretical consideration of his model. Eigen's idea (information 
theoretical content of a model) resembles the investigation 
of information theoretical (optimal coding) aspects of disor- 
dered systems, developed in statistical physics two decades 
later ll28l l29ll . In the Random Energy Model of spin glass ll30ll 
the phase transition point was derived using the information 
theory analogy 11281 12911 . and was found to yield results cor- 
responding to those derived by Eigen. The deep information 
theoretical meaning of error threshold transition in evolution 
models (equivalent to Shannon inequality for optimal coding) 
is a solid argument that transition like the one by Eq. (2) ex- 
ists for any (irregular, with lethal or neutral mutants) fitness 
landscapes. 

III. WAGNER & KRALL THEOREM. 

Wagner & Krall 1I23I1 considered a population composed of 
the master and an infinite linear chain of mutants, where each 
type mutates only to its next neighbor and the fitness r*j de- 
creases mono tonic ally. When there is no low bound of the fit- 
ness, an absence of the error threshold transition was derived. 
Indeed, when in Eq. 0/(0) = 0, there is no error threshold 
transition. But in more general symmetric fitness landscapes 
with a finite /(0), this ceases to be valid. The proof is as 
follows: the maximum of types are located at the Hamming 
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distance class L/2 or, equivalently, at fc = 0. Consider the 
logarithm of the right hand side in Eq. 6, and expand near 

fc = 0: 

fc 2 



(l- V / l-fc 2 )lnQ + ln/(fc) « -u— +ln(/(0)) + cfc e (8) 

where c and e are parameters describing the function /(fc), and 
In /(fc) — In /(0) ~ cfc £ at fc — > 0. When the fitness decreases 
slowly and so e < 2, Eq.[8]has a maximum at fc > 0, fulfilling 
the condition for selection. When e > 2, it can be demon- 
strated that there is a maximum at fc = for a sufficiently low 
reproduction fidelity Q, therefore a sharp error threshold tran- 
sition results. In the too simplistic model of Wagner & Krall, 
the right hand side of Eq.[8]lacks the quadratic term, resulting 
in a monotonic function of fc and the absence of phase tran- 
sition. In the Eigen model the quadratic term holds, breaking 
the monotonic character of R in Eq.[8]and invoking the error 
threshold. 



IV. TRUNCATED SINGLE PEAK FITNESS LANDSCAPE. 

Let us consider a symmetric fitness landscape, where there 
is non-zero fitness only to some Hamming distance from the 
reference sequence. Here we define the truncated landscape as 
a single-peak one where all sequences beyond the Hamming 
distance d = L(l — K)/2 are lethal: 



n 



Now we have 



1 1 < I < d, 



M = 



n 



I > d 



LI 



d\{L-d)\ 



(9) 



(10) 



non-lethal sequences. 

To define the mean fitness, we compare the expression of 
Eq. (6) inside the region K < k < 1 and at the border. 

The investigation of this model is instructive, see Figs. 1-4. 
When QA > 1 the phase is selective (phase I), xo is given by 
Eq.(3), ko = l,R = QA, see & 
When 



1 > QA > Q 1 



Wl-K 2 



a new phase II prevails with 

fco = 1,R = QA 



(11) 



(12) 



In the II phase xq is decreasing exponentially with L. The ex- 
pression of xq is calculated in the appendix, 



,i ! !^d + J^) 2 -1 + m 2 

x = cxp[ / dm- In — — ] 

J k * i + m 



(13) 



At the the transition point (to the II phase) the expression in 
the exponent is becoming zero, therefore the transition is con- 
tinuous. When 



the maximum is at the border and we have the non-selective 
phase EH, 



fc = K,R = Q 



(15) 



The expression for xq is defined in the appendix, Eq.(A14). 
There is some focusing around reference sequence, and xo is 
higher than 1/M. For K = 0.9 we obtained x m l/M 2/3 . 
The transition between II and III phases is a discontinuous 
one, Xq decreases Mi times, see Eq. (A21) 



f 1 1 

Mi ~ exp[L / dm— In 
Jk 2 



y/l-K 2 -V~r 



K 2 ' 



(16) 



QA<Q 



(14) 



Fig. 4 illustrates different behavior of xq in three phases. 

While in the ordinary Eigen model (without truncation of 
fitness) there is a sharp phase transition with the jump of the 
xo behavior from the Eq.(3) to the 1/M, in truncated case 
these sharp transition is moved to the transition point between 
II and III phases. Now xq is continuous at transition point 
between I and II phases. For the Summers-Litwin case with 
A' = 1 we have Mi — > L therefore the transition disappears 
as has been obtained in lf24ll . and for the K = case we get 
the result of Eigen model Mi = 1/2 N . Our formulas are 
derived for the case K > 0. 

Is phase II a selective one in the ordinary meaning? Clearly, 
its mean fitness is higher than in a typical non-selective phase 
like phase III. This point was clarified by calculating the sur- 
plus, replacing the step-like fitness function near the border- 
line (fc = K) with a smooth function /(fc) which changes its 
value from 1 to near K. In both phases, II and III, with 
QA > Q 1_%/1_ ^ 2 , the majority of the population is near the 
borderline, both on the viable and the lethal side. Therefore, 
while there is a kind of phase transition with some popula- 
tion rearrangement, phase II is identified as an intermediate 
one, with x q <C 1, as in the non-selective phase. Summers 
and Litwin [24] first realized that fco — > 1 in a truncated fit- 
ness landscape and tried to analyze the phenomenon. Unfor- 
tunately, they used too simplistic a model where all mutants 
except the nearest neighbors of the master type were lethal. 
Fig. 2 compares the relative concentration of the master at 
various superiorities in the single-peak fitness landscape de- 
scribed in IH [Toll , with the truncated fitness landscape with 
d = 8, and the case considered by Summers and Litwin. Note 
the strong dependence of the master concentration on the mas- 
ter superiority in the more realistic landscape, leading to an 
error threshold, in contrast to the case in [24]. The population 
profiles of the truncated fitness landscape at different muta- 
tion rates are shown in Fig. 3, showing the transition from a 
master-dominated population via a wide-spread mutant distri- 
bution to a non-selective case. 

In the Table 1 we give the results of numeric for different 
phases. Mean fitness is well confirmed numerically, while the 
accuracy of numerics is poor to get correct values of xq in the 
second phase, 2.37 < A < 2.77. 



How the phases can be identified? The fact is that the param- 
eters fco and s have different meanings in a statistical physics 
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FIG. 1: Phase structure for the model with the overlap parameter 
for truncation point K — 0.9, and mutation rate u = 1. I: selective 
phase with QA > 1, A is the fitness at the peak, Q — e~ u is the 
errorless copying probability of the genome. II: Intermediate phase 

with QA < 1 and QA > Q 1_ ^ 1_if2 . Ill: non-selective phase with 

QA < Q 1 -^^. 



FIG. 2: Semilogarithmic plot of xo versus A for a single peak fitness 
landscape truncated at d > 8 with u = 1, L = 3000. The upper 
curve corresponds to the Summers-Litwin model, the middle curve 
to the truncated fitness landscape as in Fig. 3. The square boxes 
correspond to a single peak fitness model. For ln(A) < 1 Eigen 
model with single peak fitness has xo = 0. 



approach. This subject has been well analyzed in a series of 
articles by E. Baake and her co-authors, s and ko have been 
identified with the transverse and longitudinal magnetizations 
of spins in the corresponding quantum model. We just link s 
with the mean characteristic of the phenotype, and ko- with 
the repertoire of genotypes. The consensus sequence should 
be determined experimentally not only via distribution Xi, but 



FIG. 4: The graphics of In xo /L for the truncated Eigen model at 
infinite L with K — 1/2, u = 1. The I phase (curve BC) is at 
1 < In A/u, the second (curve DE) at 1 < 0.88 < In A/u < 1 
and the third (curve EF)at < In A/u < 0.88. The curve CD 
corresponds to the jump at the border between II and III phases. In 
the first phase \nxo/L ~ 0(1/L). 



Phase 


Ill 


III 


II 


II 


II 


II 


I 


A 


2.2 


2.3 


2.4 


2.5 


2.6 


2.7 


2.8 


R 


0.859 


0.859 


0.883 


0.919 


0.957 


0.993 


1.030 


Rth. 


0.875 


0.875 


0.882 


0.919 


0.956 


0.992 


1.029 


Xo 


4* 10~ 117 


7* 10~ 54 


2 * 10- 24 


4.7* 10~ 16 


io- 9 


4.4 * 10~ 4 


0.0167 


xo th. 














0.0164 



TABLE I: Eigen model with truncated single peak fitness landscape 
for L = 1000, u — 1,K = 1/2. The transition between I and 
second phases is at A m 2.778, between II and III phases at A « 
2.377. 



also via distribution (xi) 2 / ^2j =0 ( x j) 2 ■ Consider 
1 E 7 fe) 2 ' 



(17) 



where Xi belongs to the phenotype class I with Ni different 
genotypes. Having such data, one can simply identify the 
phase structure. 

In the appendix we solve the truncated fitness models for 
the general monotonic smooth function f(x). The numerics 
confirms well our analytical results for the new phase. 
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FIG. 3: Truncated Eigen model with d = 8, A = 3,u = 1,L = 
3000. The distribution of probabilities of Hamming classes pi at dif- 
ferent mutation rates (u values). At high accuracy (circles) the pop- 
ulation groups around the master type, at intermediate accuracy the 
population peaks at the truncation border, at low accuracy the master 
type has practically disappeared and the majority of the progeny is 
lethal. 



V. DISCUSSION 

We rigorously solved (at the infinite genome length limit) 
Eigen's model for the truncated selection using the method 
of IU5I1 as well as methods of statistical mechanics, including 
the analogy of the error threshold to the ferromagnetic-para 
-magnetic transition. This analogy is a complicated critical 
phenomenon, presented by Leuthausser and Tarazona J3, 0] 
and well analyzed by E. Baake and co-authors flU [l3ll . In- 
stead of using only one order parameter to identify the phase 
of the model, magnetization, it is necessary to take into ac- 
count several order parameters describing the order of spins 
in the bulk lattice and at the surface. Recently the existence of 
an error threshold has been questioned ll24ll in the case of trun- 
cated selection. In this model the available sequence space has 
been shrunk to an extremely small size. Fig. 2 illustrates that 
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the unrealistic assumption in ll24ll changed the relative concen- 
tration of the master type by more than 3 orders of magnitude. 
Nevertheless, this work was certainly useful for clarifying the 
concept of quasispecies: the authors first realized the intrigu- 
ing features of a truncated selection landscape. We found a 
new evolution (intermediate) phase, when there is no success- 
ful selection via phenotype trait (the majority does not share 
the trait), while there is some grouping of population at geno- 
type level. The intermediate evolution phase differs from the 
non-selective phase, the frequency of wild type being -7= or 

higher in the intermediate phase compared with ~ -h in the 
non-selective phase of Eigen model. We proposed a parame- 
ter to measure the hidden grouping of population in a geno- 
type level, Eq. (17). Such hidden ordering could be important 
in case of changing environment: it is possible to force the 
whole population to extinction changing the fitness of a small 
fraction (much smaller than 1/L, but much higher than 1 /M, 
M is the number of different genotypes) of viruses in the pop- 
ulation. We recommend virologists to measure the consensus 
sequence not only using the the probabilities Xi, but also xf. 
The evolution picture of the virus population is robust when 



two versions of consensus sequence are close to each other. 
In experiments [21] has been observed an evolution picture, 
qualitatively similar to the intermediate evolution phase. 

How the error threshold transition is connected to the virus 
extinction in virus experiments, is another story. Several 
mechanisms are possible: an error catastrophe, as well as a 
critical mean fitness in order to maintain viral growth rf32tl . In 
this work, we observed the new phase with single peak and 
symmetric landscapes, but this phase exists probably for any 
(including irregular) fitness landscape with a lethal wall in se- 
quence space. 



VI. ACKNOWLEDGMENTS 

D.B. Saakian thanks the Volkswagenstiftung grant "Quan- 
tum Thermodynamics" and the National Center for Theoreti- 
cal Sciences in Taiwan and Academia Sinica (Taiwan) under 
Grant No. AS-95-TP-A07 for the financial support. We thank 
E. Domingo and M.W. Deem discussions. 



[1] E. Domingo, C. K. Biebricher, M. Eigen, and J.J. Holland, 
Quasispecies and RNA Virus evolution: Principles and Con- 
sequences. (Landes Bioscience, Austin, TX, 2001). 

[2] M. Eigen, Proc. Natl. Acad. Sci. USA 99, 13374 (2002). 

[3] M. Eigen, Naturwissenschaften 58, 465 (1971). 

[4] C. K. Biebricher, Cold Spring Harbor Symp. Quant. Biol. 52, 
299 (1987) 

[5] D.R. Mills, R. L. Peterson, and S. Spiegelman, Proc. Natl. 

Acad. Sci. USA 58, 217 (1967). 
[6] J. Swetina, and P. Schuster, Biophys. Chem. 16, 329 (1982). 
[7] I. Leuthausser, J. Stat. Phys. 48, 343 (1987) 
[8] P. Schuster, and J. Swetina, Bull. Math. Biol. 50, 635 (1988). 
[9] P. Tarazona, Phys. Rev. A 45, 6038 (1992). 
[10] M. Eigen, J. S. McCaskill, and P. Schuster, Adv. Chem. Phys. 

75, 149 (1989). 
[1 1] S. Franz, and L. Peliti, J. Phys. A 30, 4481 (1997). 
[12] E. Baake, M. Baake, and H. Wagner, Phys. Rev. Lett. 78, 559 
(1997). 

[13] E. Baake, and H. Wagner, Genet. Res. 78, 93 (2001) 

[14] J. Hermisson, O. Redner, H. Wagner, and E. Baake, Theor. Pop. 

Biol. 62, 9 (2002) 
[15] D. B. Saakian, and C.-K. Hu, Proc. Natl. Acad. Sci. USA 103, 

4935 (2006). 

[16] D. B. Saakian, E. Munoz, C.-K. Hu, and M. W. Deem, Phys. 

Rev. E 73, 041913 (2006). 
[17] J.-M. Park and M. W. Deem, PRL, 98, 058101(2007). 
[18] D.B. Saakian, Journal of statistical physics, 128,781(2007). 
[19] D. B. Saakian, O. Rozanova, and Andrei Akmetzhanov, 

Phys.Rev. E 78, 041908 (2008). 
[20] R. Sanjuan, A. Moya, and S. F. Elena, Proc. Natl. Acad. Sci. 

USA 101, 8396 (2004). 
[21] E. Lazaro, C. Escarmis, J. Perez-Mercader, S. C. Manrubia, and 

E. Domingo, Proc. Natl. Acad. Sci. USA 100, 10830 (2003). 
[22] M. Pariera, M., G. Fernandez, B. Clotet, and M. A. Martinez, 

Mol. Biol. Evol. 24, 382 (2007). 
[23] G. P. Wagner, and P. Krall, J. Math. Biol.32, 33 (1993). 



[24] J. Summers, and M. Litwin, J. Virol. 80, 20 (2006). 

[25] B. Drossel, Biological evolution and statistical physics Ad- 
vances in Physics 50, 209 (2001). 

[26] E.V. Nimwegen, J.P.Cnitchfield, and M. Huynen Proc. Nad. 
Acad. Sci. USA 96, 97 1 6 ( 1 999) . 

[27] H. Woodcock and PG Higgs, J. Theor. Biol. 179:61-73, (1996). 

[28] N. Sourlas, Nature 339, 693 (1989). 

[29] D. B. Saakian, JETP Lett.55,2(1992). 

[30] B. Derrida, Phys. Rev. B 24, 2613 (1981). 

[31] Kimura and Maruyama. Genetics 54,1303(1966) 

[32] J. J. Bull, R. Sanjuan, and C. O. Wilke, J. Virol. 81, 2930 
(2007). 



Appendix A. Application of HJE for truncated symmetric 
landscape. 

Let us apply the Hamilton-Jacobi equation (HJE) method 
lfl8[[l9ll to Eigen model with truncated symmetric fitness land- 
scape. Consider a piecewise smooth, monotonic fitness func- 
tion f(m), 

f(m) = /o(m),m > K, 

f(m) = 0,m<K (A.l) 



Here /o (m) is a monotonic analytical function. 

We denote by fco the maximum point in Eq.(6). As s is de- 
fined by Eq. (7), for the monotonic fitness function we obtain 

s < fc , (A.2) 
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Phase 


III 


III 


II 


II 


II 


I 


I 


I 


c 


0.5 


1 


1.2 


1.3 


1.5 


1.9 


2.1 


3 


ln(R) 


-0.090 


-0.016 


0.0163 


0.035 


0.084 


0.212 


0.287 


0.665 


ln(R) th 


-0.071 


-0.009 


0.0160 


0.034 


0.083 


0.213 


0.288 


0.666 


s 


0.507 


0.507 


0.507 


0.508 


0.510 


0.525 


0.541 


0.663 


s th. 


0.5 


0.5 


0.5 


0.5 


0.500 


0.5 


0.523 


0.666 


— ln(a;o) 


223 


199 


185 


177 


160 


131 


120 


84 



TABLE II: Eigen model with truncated fitness, K = 1/2, L — 
500, u = l,f(x) — exp(cm 2 /2). The transition between I 



and second phases is at c = 2, between II and III phases at 
1.1547. In the I phase we have: s = 
l/c) 2 /2. In the II phase: s = K,ln(R)/u = 
K,ln(R)/u = cK 2 /2 + 



c = 1/VT-K 2 
l/c,ln(R)/u = c(l 
c(l - l/c) 2 /2. In the III phase: s 
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Phase 


III 


III 


III 


II 


II 


II 


I 


I 


c 


0.5 


1 


2.2 


2.4 


2.5 


2.6 


3 


4 


ln(R) 


-0.135 


-0.016 


-0.022 


0.0202 


0.0433 


0.0676 


0.172 


0.459 


ln(R) th 


-0.11 


-0.092 


-0.042 


0.0201 


0.0435 


0.0678 


0.171 


0.460 


s 


0.507 


0.507 


0.510 


0.514 


0.518 


0.523 


0.563 


0.698 


s th. 


0.5 


0.5 


0.5 


0.5 


0.5 


0.5 


0.556 


0.701 


x 


230 


216 


142 


128 


122 


117 


98 


70 



TABLE III: Eigen model with truncated fitness, K = 1/2, L — 
500, u = l,f(x) — exp(cm 3 ). The transition point is at c « 
2.31, mo = 0.866 in the model without truncation. The transition 
point between 11,111 is at c ~ 2.79. 



and there is only one solution of Eq. (7). 
Using an ansatz 

pi = exp[LU (m,t)], (A3) 

to = 1 — 21 /L, in [ 1 8] has been derived the following equation 

dUo(m, t) 



exp{u[cosh(2 



L- 

dU (m,t), 
dm 



dt 



fo(m)e u x 



. m 8^(2^21*) )]} (A .4) 
dm 



The HJE approach works for any piecewise smooth fitness 
functions. Eq. (Al) is such a case. To solve our truncated 
case we should just use different analytical solutions for the 
regions — 1 < m < K and K < m < 1. 

Assuming an asymptotic Ua{m,t) = ~t + [/(to), we de- 
rived £l! 



exp{w[cosh(2 



R = fo(m)e- u x 

dll(m), . , ,dU{m) .„ 

—^-L + msinh2— ^—L )]}, 
dk dm 



(A.5) 



where R is derived by Eq.(6). The surplus s is defined as 
the value of to where U (m) has a maximum. When s is in- 
side the region [K, 1], U'(s) — 0. At extremum points with 
U'(m) = Eq.(A5) gives /(,s) = R. As for a monotonic 
fitness function there is a single solution for Eq. (6), U(x) has 



a single maximum point in this case, therefore it is a concave 
function and we take U(s) = 0. 

We use Eq.(A5) to define pi with an accuracy O(l) for 
Inpi, calculating U(m) = U(s)+ f U'(m)dm for the corre- 
sponding to = 1 — 21/ L. Moreover, it is possible to calculate 
\api with a higher accuracy 0(1/ L). In [18] we gave explicit 
formulas for the case of parallel model. It is possible to con- 
struct similar results for the Eigen model as well. 

We have two branches of solutions for Eq. (A5): 



, 1 q ± y/q 2 - 1 + to 2 
U (to) = - In ^— . 

2 1 + TO 

1 R 
q = — In. - 



u /(to) 



1 



(A.6) 



It is a principal point the choice of different solutions. We 
choose the proper branch assuming: 

• U(m) is continuous function, 

• U'(m) is continuous function, 

• U(m) is a concave function for the monotonic fitness 
function /(to). 

The transition between two branches (± solutions in Eq.(A5)) 
is only at the point where q 2 — 1 + to 2 = or 



R = f{m)e~ u+uV1 - m * 



(A.7) 



According to Eq.(6), R is the maximum of the right hand side. 
Thus we should choose only the branch with " — " sign when 
fco is at the border, fco = K. When k is inside the interval 
[K, 1] , then we choose the " — " solution for the interval [fco , 1] 
and " + " solution in the interval [K, fco]- 

For the V(m) — ln(V Thpi) / L we have another equation 
[18], 



R = /o( m )e u exp{u[cosh(2 



dV(m) 
dm 



)Vl-TO 2 ]} (A.8) 



The minimum of the right hand side via V just gives the 
f (m)e~ u+u ^ 1 ~ m2 . Thus at the maximum point m = fco of 
function V(m) we have V'(ko) = 0. In this article we con- 
sider the case when Eq.(6) has a single solution fc = fco. Then 
V(m) is a concave function. 

Solution of equations (A5),(A7) are simply related,[18], 



V(m) = U(m) + 



(1 + m) ln±±n 



(1 - to) In i=2i 

L A MA.9) 



Consider now different phases of our model. The selective 
one with K < fco < 1, K < s < 1; the non-selective one with 
fco = K,s = K, and intermediate one with K < fco < 1, s = 
K. 

Selective phase. 

Now R is given by Eq. (6) with a K < fco < 1. We used 
" - " solution of Eq.(A6) for fc < to < 1 and the " + " 
solution for K < to < fc . The maximum points of both 
functions U(m) and V(m) are inside the interval [K, 1]. We 
have U'(s) = and V'(ko) = 0. The formulas for the steady 
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state distributions are the same as in lfl8ll . We have a mean 
fitness 



R = /(/c )e- u+tl v T ^ 



(A. 10) 



For the pi,m = L(l — 21/ X), fco < m < 1 we have an ex- 
pression 



= cxp[L / dm- In 

Jko 2 1 + m 



V 7 ? 2 — 1 + ITT- 2 



-L I dm— In < ^—— 

2 1 + m 



g=-ln-^- + l (A.ll) 

u /(to) 



For to < ko we have 



pi = exp[L / dm- In - 

Js 2 1 + m 

Q= -In 7^ + 1 (A.12) 

u /(to) 

Nonselective phase. 

Now the maximum of Eq.(6) is at the border fco = K, and 
we have 



(A. 13) 



We use " — " solution of Eq.(A6) for the whole interval K < 
to < 1. For the pi, to = 1(1 — 21/ L) we have an expression 



pi = exp[L / dm — In 



A' 



1 + TO 

1 R 
q = -\n— — + 1 (A.14) 



u /(to) 

and the maximum is for pa with to = 1 ~ 2d = If. 



for to < 1), we have q = vT — X 2 and 



For the single peak fitness case (/(l) = A and /(to) = 1 

+A.15) 



a^o = Po ~ exp — / dm In — 

2 J K 1 + TO 



For the 1/M we have an expression 

Ik 



1 r L f 1 1 — TO. 

— -exp-/ dm In—— (A.16) 

M 2 y K l + to 



Intermediate phase. Now mean fitness is given by Eq.(6) 
with some K < fc < 1 and s = K. We used " — " solution of 
Eq.(A6) for fco < to < 1 and " + " solution for K < to < fco. 
When to > fco, we have 



fco 



Pi = exp[L / dm- In ■ 

J K 2 1 + 771 



L / dm— In 



1 , g - VV - 1 + ' 



1 + TO 



(A. 17) 



In case of X < to < fco we have 



Pi = exp|Z / dm — In - 

ja ^ 1 



(A. 18) 



We took {/ (K) — 0, as the maximum of population is at the 
border with to = K. 

For the SP case we have q = MA and 



/ 1 

xq = cxp[L / dm— In 
Jk 2 



1 , MA + ,/ 1^)2 _ I + m 2 



1 + m 



<A.19) 



Above the transition point In A = vT — if 2 Eq.(19) gives 



r f 1 , 1, Vl-^ 2 + Vm 2 -X 2 1/A „ m 

Xo = exp LL / dm- In (A. 20) 

Jk 2 1 + m 

We see that at the transition point there is a jump, xq decreases 
Mi times, 



f 1 1 

Mi — exp[L / dm- In 

Jk 2 



1, Vl - K 2 + yjm 2 - K 2 



VI - K 2 - 



K 2 



](A.21) 



For K = 0.5 Eq.(A20) gives ln(a; )/£ « -0.057, while 
ln(M)/L « —0.56. Thus above the transition point to the 
third phase 



Log(x a ) 



1 



AfO.i 



(A.22) 



Consider the case K = 0.99. Now we have \n(xo)/L 
-0.01 and ln(M)/i » -0.031. Thus 



ln(x ) 



1 



A/0.32 



(A.23) 



Consider evolution model with general fitness function f{x). 
Assume that without truncation the error threshold transition 
is a discontinuous one, and there is a jump from non-zero fco > 
in selective phase to fc = solution in Eq.(6) for non-selective 
phase. Let us introduce the truncation. Choosing K < fco, we 
have three phases, see Table 3, and xq decreases AI2 times at 
the transition point between II and III phases, 



M 3 ~ e * p[i /' 4»i in ^IZS + ^Z5 (A.24) 



2 Vl - K 2 - Vm 2 - K 2 



If in the original (without truncation) model the error thresh- 
old transition is a continuous one with fco = 0, after truncation 
we have different expressions for xq in the II (fco > K,s — 
K) and III (fco — K,s = K) phases while continuous transi- 
tions I — > II and II — > III, see Table 2. We have a similar 
behavior for the phase transitions in case of originally (with- 
out truncation) discontinuous error threshold transition, if the 
truncation parameter K is chosen too large, K > ko- 

Let us derive an important constraint for the population of 
the class at the Hamming distance 77 = L(l — fco)/2. For the 
corresponding V we have 



V(fto) 



iln-p^ 



(A.25) 
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As pd — 1 (the majority of population is at the border with the The last inequality supports the choice of order parameter in 
overlap parameter K = (1 — 2d) /L), we have Eq. (17). 

V(K) 



1 , 1 

— m —=, 

L VM 



(A.26) 



For the single peak case n = 0, and we get from Eq.(A28) 



as M — Nk- We proved before that V(m) has a single max- 
imum (in our case with a single solution for the maximum 
point k in Eq. (6)). Thus 



V(K) > V(k ) 



(A.27) 



which gives 



Pn > 



M 



M 



(A.29) 



(A. 28) Eqs.(A22,A23) give even higher values for xq. 



